Approximating the largest eigenvalue of network adjacency matrices 
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The largest eigenvalue of the adjacency matrix of a network plays an important role in several 
network processes (e.g., synchronization of oscillators, percolation on directed networks, linear sta- 
bility of equilibria of network coupled systems, etc.). In this paper we develop approximations to the 
largest eigenvalue of adjacency matrices and discuss the relationships between these approximations. 
Numerical experiments on simulated networks are used to test our results. 
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I. INTRODUCTION 

In recent years, there has been much interest in the 
study of the structure of networks arising from real world 
systems [IJ. Another concern has been dynamical pro- 
cesses taking place on networks, and the impact of net- 
work structure on such dynamics. The largest eigenvalue 
of the network adjacency matrix has emerged as a key 
quantity important for the study of a variety of different 
dynamical network processes. In particular, this is the 
case for synchronization of coupled heterogeneous oscil- 
lators 0] , percolation on directed networks [3| , linear sta- 
bility of the fixed points of systems of network- coupled 
ordinary differential equations 0], and several other ex- 
amples in physics and chemistry 0, Q . In this paper we 
study methods of obtaining approximations to the max- 
imum eigenvalue for the adjacency matrix for the case of 
large complex networks. 

We consider a network as a directed graph with N 
nodes, and we associate to it an N x N adjacency matrix 
whose elements A+j are one if there is a directed link 
from i to j and zero otherwise (An = 0). We denote the 
largest eigenvalue of A by A (it may be shown that of 
all the eigenvalues of A, the one with largest magnitude 
is real and positive Q). Furthermore, we note that it is 
often the case that the largest eigenvalue is well separated 
from the second largest eigenvalue. 

The properties of A have been studied in the context 
of small or regular graphs ,5| and in classical Erdos- 
Renyi random graphs Q. However, the structure of 
real world networks is usually more complex, as evi- 
denced by the fact that the degree distribution in a 
large number of examples has been found to be highly 
heterogeneous (often following a power law |8j), where 
the out- degree and in- degree of a node i are defined by 



C* = E7=i^ arxddf = £}= 
tributions', P(d in ,d out ), P m {d ln ) 

and P out (d out ) = J P(d in ,d out )dd m , are defined as the 
probabilities that a randomly chosen node has degree d m 
and/or d out . If A = A T we say the graph is undirected. 
For an undirected graph each link serves as both an in 
and out link for each of the two nodes it joins and for 
any node i we have df 1 — d° ut = d,-. Thus in the undi- 
rected case we write P(d) to denote the corresponding 
degree distribution. The effect of the degree distribution 
on the largest eigenvalue of the adjacency matrix has 
been explored recently by Chung et al. @, who consid- 
ered a particular ensemble of random uncorrelated, undi- 
rected networks whose number of nodes N is large (see 
also [13, HH, (HI)- Here, by uncorrelated we mean that 
we regard the network to be a random draw from some 
ensemble of networks for which the joint probability dis- 
tribution of the node degrees Q(d\, d%, . . . , djsr) factors, 
Q(d 1 ,d 2 ,...,d N ) = P(d 1 )P(d 2 )...P(d N ). Chung et al. 
found that in the limit N — > oo these networks yield an 
expected largest eigenvalue that is determined by the ra- 
tio A of the second to first moment of the average degree 
distribution, 
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The 'degree dis- 
/ P(d in ,d out )dd out , 



X=(d 2 )/(d), 



(1) 



where (x) 
degree d„ 



= N 1 J2i=i x ii an d by the expected largest 
ax - Specifically, they found that 




A > d mnx log AT, 



in ax 

> Alog 2 iV. 
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Some previous results for dynamical processes in net- 
works have been stated in terms of the quantity A, for 
example, the synchronization threshold in the mean field 
theory of coupled oscillators in networks 0, [TH, [3| and 
the network percolation and epidemic spreading thresh- 
olds [HHH. 

Real world networks often have some amount of degree- 
degree correlations [l7|, i.e., a node of a given degree is 
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more likely to be connected to nodes with particular de- 
grees than would be expected on the basis of chance. 
Networks in which high degree nodes connect prefer- 
entially to high (low) degree nodes, and viceversa, are 
called assortative (disassortative) . Such correlations can 
affect dynamical process on networks, as has been demon- 
strated for example in epidemic spreading models and 
percolation 0, Gj| Hj • 

We also emphasize that the in- and out- degrees at 
a node can have different distributions [i.e., P in (d m ) ^ 
Pout{d out )], as has been noted for some corporate infor- 
mation and genetic networks [2l|, , and that there are 
potential correlations between the in- and out- degrees 
at the same node which can also significantly affect the 
largest eigenvalue. 

The rest of this paper is organized as follows. SectionHII 
reviews the characterization of degree correlations. Sec- 
tion |TTT] develops the theory of the maximum eigenvalue A 
for the case of networks that satisfy a certain Markovian 
property. Some of the considerations of Sec. lIIII are sim- 
ilar to theory in previous papers, where, however, those 
previous considerations were application specific, and the 
more general applicability to the largest eigenvalue was 
not apparent. Section [IVI considers results for numerical 
constructions of network adjacency matrices of different 
types. 



II. DEGREE-DEGREE CORRELATIONS 
A. In and out degree correlations 

As we discussed in the introduction, we define 
P(d m ,d° ut ) as the probability that a randomly chosen 
node has in-degree d m and out-degree d out . 

As we shall see later, for networks without neighbor 
degree-degree correlations, a first order approximation 
to the eigenvalue A, generalizing Eq. ([1]), is given by 



coefficient, 



A = (d m d out )/(d), 



(3) 



where we recall that (•) denotes an average over nodes, 
and (d) means either (d-™) or (d° ut ) which are equal: 

If d\ n and d° ut are independent, the largest eigenvalue 
is approximately given by A = (d), and if d\ n and d° ut 
are perfectly correlated, so that d\ n — d° ut = di (e.g., 
as in an undirected network), the largest eigenvalue is 
approximately A = (d 2 )/(d). We see that correlations 
between d m and d out can crucially affect the eigenvalue, 
especially if the second moment of either degree distribu- 
tion diverges with increasing N, while the first moment 
converges: in such a case independence leads to a finite 
eigenvalue estimate, and perfect correlation to a diverg- 
ing eigenvalue estimate. 

In order to quantify the correlations between d m and 
d out at a node, we define the node degree correlation 



rj = (d m d out )/(d) 



(4) 



Note that if there are no correlations, r\ = 1, and r\ is 
larger (smaller) than I for positive (negative) correla- 
tions. In terms of the correlation coefficient, A is given 
by 



\ = v (d). 



(5) 



For undirected networks A is a symmetric positive ma- 
trix, and thus its largest eigenvalue satisfies 



A > 



T ' 

i 1 



(6) 



for any N- vector q. Choosing the components of q to be 
zero for nodes not connected to the node of largest degree 
d m ax, to be one for nodes connected to the node of largest 
degree, and y 'd max for the node of largest degree, Eq. © 
yields 



A > \fd 

n 



(7) 



if Vd max 5* A, the mean-field approximation ([3]) must 
be incorrect. Thus, for undirected networks we use as a 
heuristic alternative to the mean-field approximation, 



A w max{A, \/d max }, 



(8) 



which is consistent with both of the regimes considered 
in the rigorous result ([2]). (We remark, however, that 
Eq. ([H]) holds in principle only if the ratio A/ \Jd max or 
its inverse is large enough [see Eq. ([2])]). One way of 
viewing Eq. ([5]) is that Eq. © implies that A is at least 
as large as the maximum eigenvalue of any subnetwork 
of the original network (by a subnetwork we mean one 
obtained by deleting links of the original network) . Con- 
sidering a subnetwork consisting of the node of maximum 
degree and the nodes connecting to it, shown in Fig. Q] 
(a), Eq. ([7]) corresponds to the fact that y 'd max is the 
maximum eigenvalue of this star network. The regime 
A ~ V dmax in Eq. (|8|) applies to networks whose largest 
eigenvalue is dominated by the node with largest degree. 

We now contrast the above situation for undirected 
networks with what can happen for directed networks. 
We first note that the reasoning leading to ([5]) may not 
hold. For example, Eq. ([6]) no longer applies, and a node 
with many in and out links does not necessarily make a 
subnetwork with large eigenvalue. Regarding the latter 
point, consider Fig.[T](b) as compared to Fig. [TJa). This 
directed network has all its eigenvalues zero, because no 
pair of in and out links connect the same two nodes. 
Moreover, if one views d ln and d out to diverge as N in- 
creases at a rate sufficiently more slowly than N, then, 
as N approaches infinity, the probability of an out link 
and an in link from the central node connecting to the 
same peripheral node goes to zero with increasing A, if 
these peripheral nodes are randomly chosen. 
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FIG. 1: Undirected (a) and directed (b) star networks illus- 
trating the fact that, in the directed case, a node with many 
in and out links does not necessarily constitute a subnetwork 
with large eigenvalue (see text). 



of length 1 (or the number of links). Accordingly, let 
us write J2i,j A ij dfdf 1 = n 3 , N(d m d out ) = n 2 , and 
N(d) = n\. With this notation, the coefficients 77 and p 
can be rewritten as 



V 



n 2 
ni(d) ' 



P = 



n 3 ni 



//.-, 



(13) 



(14) 



As an example of this interpretation, we note that for net- 
works with uncorrelated in- and out-degrees, the number 
of paths of length two is n 2 ~ n 1 (d), and so i] w 1. For 
networks where there are no neighbor degree correlations, 
the number of paths of length three, 71,3, can be obtained 
from the number of paths of length two, n 2 , times the 
average branching ratio given by n 2 /ni, and thus p w 1 
for such networks. 



B. Neighbor degree correlations 

For our subsequent analysis it is useful to introduce an 
edge degree correlation coefficient p characterizing the 
correlation between the in-degree at node i and the out- 
degree at node j, where a directed link goes from i to 
3, 

p=(d?d° u %/(dr) e (d° ut ) e , (9) 
where (Qij) e denotes an average over edges, 

i,j hi 

Using (|10p we have 

<c > e = E E A v = E c*^/ E A » 

= (d out d m )/(d), (11) 

and so 

p =Hfw- ' (12) 

(Our definition of the edge degree correlation coefficient 
p is related to but slightly different from that of Newman 
0.) 

We note that the term J^i j Aijd\ n d° ut that appears 
in the definition of (d\ n d° ut ) e is the number of directed 
paths of length three, i.e., the number of links of the form 
tl — ► i — ► j — ► 7n. Similarly, (d™) e N(d) = {d° ut ) e N{d) = 
N(d m d out ) = J2 k d °k Utd T = Ei, j>k A ik A kj is the number 
of paths of length 2, and N{d) is the number of paths 



III. LARGEST EIGENVALUE OF MARKOVIAN 
NETWORKS 

A. Formulation 

For generality, in this section we allow different types 
of network nodes, where we specify the node type by an 
index a — 1,2,..., M, where M is the total number of 
possible node types (e.g., in the case of social networks 
connecting people, a might label sex, race, social class, 
etc.). Furthermore, we introduce the quantity 

z = (d m : d out ,a), (15) 

which we refer to as the degree. With this definition, we 
use P(z) to denote the degree distribution, i.e., P(z) is 
the probability that a randomly chosen node has degree 
z. This implies, for example, that 

E P{z) = N a /N, (16) 

where N a is the number of nodes of type a and N = 
J2<jN<7- (While our numerical examples in Sec. IIVI are 
for the case of a single node type, M — 1, the subsequent 
considerations in the present section do not have this 
restriction.) 

We consider a particular class of networks for which 
the only nontrivial correlations are between nodes that 
are directly connected by a single link. Such Markovian 
networks have been considered in previous works on epi- 
demic spreading and percolation |l5|, [l6|, [TH, [2(| ■ Under 
this assumption, if we define P{z'\z) to be the probability 
that a node with degree z has an outgoing link to a node 
with degree z', then if we choose a random outward path 
of length two from a node with degree z, the probability 
that the first hop ends on a node with degree z' and the 
second on a node with degree z" is, by this assumption, 

P(z',z"\z) = P{z"\z')P(z'\z). 
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Let ipz be the expected number of directed paths 
of length m whose starting node has degree z. Using 
the assumption that the network is Markovian, we can 
express tp 



(m+l) 



in terms of tpi 1 as 



^(m+l) = d o 



(m) 



(17) 



The number of paths of length m grows, in the limit of 
large m, as the largest eigenvalue A, ip z ™^ ~ ipzX m @- 
Therefore, we can associate to Eq. (TT71) the eigenvalue 
problem 



(18) 



and we seek its largest eigenvalue Ac and its correspond- 
ing eigenfunction ip z . We consider Ac to be an approx- 
imation to A that is more accurate than the mean field 
result in that it includes correlations between connected 
nodes, A w Ac- Additionally, an approximation uc to 
the right eigenvector u of A can be obtained in terms of 
ipz in a similar way: 



(u c )i = tp Zi , 



(19) 



and an analogous expression holds for the left eigenvector 
v of A, using the left eigenvector of the matrix d out P(z'\z) 
instead of ipz- We will refer to Eqs. (fl~8|) and (fT9j) as the 
Markovian approximation. 



B. 



e expansion 



While in many cases Eq. (jTHJ) can be solved directly, it 
is also of interest to explore approximations to its so- 
lution. Thus, we will here expand (| 18(1 about a zero 
order approximation in which degree-degree correlations 
are neglected and in which the first order correction gives 
the perturbation to the uncorrelated case due to a small 
amount of degree-degree correlations. In particular, we 
will be interested in zeroth order and first order approx- 
imations to the largest eigenvalue of A and to the corre- 
sponding right and left eigenvectors u and v, where 



Au = Ait and v A = Xv 



(20) 



Following previously used terminology, we refer to the 
zeroth approximation as the mean field theory. 

When there are no neighbor degree-degree correlations, 
P{z'\z) becomes independent of z: 



p(zi\z) = P(z>) = (d m yp(z>)/{d), 



(21) 



where P(z) is the degree distribution. We will expand 
Eq. (TTSJ) about the uncorrelated case. For this purpose, 
we will write 



P(z'\z) = P(z') + edP(z'\z), 



(22) 



where 



SP(z'\z) = P{z'\z) - P{z'), 



(23) 



and e is an expansion parameter that we formally con- 
sider small, although, in reality, e = 1. Introducing ex- 
pansions for Ac and ip z , 



A c = A + e<5A + . . . 



(24) 



?Pz =*Pz+ eSip z + ... (25) 
Expanding Eq. (|18p to zero order in e, we obtain 

i> z = d out , (26) 

and 

A= (d m d out )/(d). (27) 

Thus in the zeroth order approximation, the right eigen- 
vector of A has components 



u* = «*?**. 



(28) 



To obtain the left eigenvector, we follow the same steps 
as above but with A replaced by A T . This interchanges 
the roles of d m and d out , thus yielding 



for the left eigenvector. 

Expanding to first order in e, we obtain 



(29) 



\5i> z + <5AV> 2 = d out P{z')Hz> + d out SP{z'\z)4> z ,. 

z' z' 

(30) 

Multiplying by P(z) and summing over z we obtain, after 
some simplification, 



SX 



(A(d))- 1 J](d 0Ut )'d "*d m P(z'|z)P(z) 



A (31) 



The probability that a randomly chosen edge starts at 
a node with degree z is P(z) = (d)^ 1 P(z)d out , and, 
therefore, the term (d)^ 1 Y^ z . zl {d out )' d out d ln P{z'\z)P{z) 
is equal to 



J2d m (d out )'P(z')P(z'\z) = (d^d 

z,z' 



'OUt^ 



(32) 



To first order, therefore, we obtain 

A c ps A + S A = - I = Xp, 
A 



(33) 



where p is defined in Sec. ITU and we call A = Xp the linear 
approximation. 
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FIG. 2: Schematic representation of the rewiring algorithm. 
Edges i —* j and n — > m were chosen at random. When cre- 
ating a network with assortative (disassortative) correlations, 
they are replaced with edges i — > m and n — » j if the resulting 
network has a larger (smaller) value of p. 



Similarly, 

5ip z = A -1 ^ 5P(z'|z)(rf°"*)'. (34) 

z' 

We now briefly comment on the range of validity of the 
expansion results. As the rigorous result Eq. © shows, 
the mean field result for undirected networks is not valid 
when the network is dominated by the node with maxi- 
mum degree. A network in which the maximum degree 
is too large compared with the bulk of the degree distri- 
bution can not satisfy Eq. ([2"Tj). For example, the star 
network of Fig. QJa) is very degree-degree correlated be- 
cause all of its outer nodes connect only to the high de- 
gree hub node and, indeed, application of Eq. (|2"Tj) to 
this simple star network yields an incorrect result. Our 
expansions about an uncorrelated network therefore im- 
plicitly assumed that this network was not dominated 
by the node with maximum degree. We speculate that 
directed networks might satisfy Eq. (|27p with a less re- 
strictive condition on the maximum degrees, as argued 
in Sec. |TT] and Fig. [TJ A more rigorous delineation of 
the range of validity of the Markovian approximation for 
correlated and/or directed networks along the lines of 
Ref. @ is open for further research. 

IV. NUMERICAL TESTS ON SIMULATED 
NETWORKS 

In this section we numerically construct random net- 
works and use them to compare with the predictions of 
Sec. IIIII We shall restrict ourselves to networks which 
are not dominated by nodes with large degree, as dis- 
cussed at the end of Sec. IIIIB1 In order to study the 
variation of the largest eigenvalue A of the adjacency 
matrix A, we first construct large (N S> 1) approxi- 



mately uncorrelated networks with a given expected de- 
gree distribution using a generalization of the Random 
Graph model of Chung et al. [?|. First we associate to 
each node k target in and out degrees (d™ , d™ 1 ) , where 
Efc 4" = Efe d° k ut = N(d). Note that the choice of tar- 
get node degrees can be made to correspond to a desired 
degree distribution (e.g., scale free, or Poisson). We then 
choose each element of the adjacency matrix Ay ran- 
domly to be one with probability d° ut d™ / (N(d)) and zero 
otherwise. This determines a network realization with in 
and out degrees d° ut = J2j Aij and d™ = ^Ay. In 
general, (djfe 1 , d™*) can be different from the target val- 
ues (d™, d1 ut ). Nevertheless, with high probability, for 
large N, the resulting degree distribution P(d m , d out ) of 
this randomly chosen network will be approximately, in 
a suitable sense, the target distribution [9|]. In particu- 
lar, the moments (d m d out ) and (d) will be approximately 
unchanged when calculated using either P or the target 
degrees. Furthermore, p w 1 for N S> 1. The net- 
work generated by this algorithm is directed (i.e., A is 
asymmetric); symmetric networks can be generated by 
first considering only i < j and then setting Aji = Aij. 
Starting from an uncorrelated network generated by this 
algorithm, we then rewire the connections in such a way 
that the degree distribution is preserved. On doing so, 
the correlation coefficient changes and we calculate the 
largest eigenvalue for different values of p. The rewiring 
algorithm we use is a simplified version of that used in 
and consists of the repeated iteration of the following 
steps: 

1 . Two edges are chosen at random. Assume one con- 
nects node n to node m and the other connects 
node i to node j. 

2. Let 

H(iJ;n,m) = d™d% t +d?d? ut -di n d™ t -4 n d% t . (35) 

The two edges chosen in step 1 are replaced with 
two edges connecting node n to node j and node i to 
node m (see Fig. [2]) if sH(i, j; n, m) < 0, and are left 
alone otherwise. Setting s = 1 or —1 we produce 
assortative or disassortative networks, respectively. 

As we iterate from p = 1 with s = 1 (s = — 1), p 
steadily increases (decreases). Thus, we produce a se- 
quence of networks with successively larger (smaller) p. 

In Fig. [3] a) we show the largest eigenvalue A (solid 
line), the linear approximation given by Eq. (|33[) (dashed 
line), and the Markovian approximation Ac from Eq. (|18|) 
(boxes) as a function of the correlation coefficient p for an 
undirected network with power law degree distribution 
generated as described above with N — 25000, (d) = 
100, and exponent 7 = 2.5. In Fig. [3] b) we plot the 
same quantities for a directed network with N = 10000, 
(d) = 20, and exponent 7 = 2.5. In these plots there is 
no discernible difference between the approximation Ac 
and the actual values of A. We observe that the largest 
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V. CONCLUSION 

In this paper we have considered several approxima- 
tions to the largest eigenvalue of the adjacency matrix of 
large, directed networks. The mean field result (J3J) ap- 
pears to apply well to networks whose neighboring nodes 
are uncorrelated in their degrees. The linear approxima- 
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FIG. 4: Entries of the eigenvector u corresponding to A versus 
the entries of its Markovian approximation uc [Eq. (|19p ] in 
an arbitrary scale for the network of Fig. |3]b) at p ~ 0.9. 



FIG. 3: Largest eigenvalue A (solid line), linear approxi- 
mation [Eq. (|33p . dashed line], and Markovian approxima- 
tion Ac [Eq. Q5) , boxes] for a) an undirected network with 
AT = 25000, 7 = 2.5 and (d) = 100, and b) a directed network 
with N = 10000, 7 = 2.5 and (d) = 20. 



eigenvalue depends strongly on the correlation coefficient: 
in the undirected case, it increases more than three times 
as p varies from 0.4 to 1.7. Also, we see that in these 
examples the linear approximation works for \p— 1| < 0.2, 
but fails for larger values of \p— 1|. In the undirected case, 
A is larger than the linear approximation, which follows 
from Eq. © if we set qi — di . 

In Fig. 3] we show the eigenvector Ui for the network 
of Fig. [3]b) at p ss 0.9 plotted against the corresponding 
approximation (uc)i, Eq- (O using an arbitrary scale. 
There is good agreement between the true value and its 
Markov estimate. 



the Markov model (fT5)) applies for arbitrarily strong de- 
gree correlations between neighbors. The price to be paid 
for a more refined approximation is the requirement of 
greater knowledge of the network (e.g., use of (p~8|) re- 
quires knowledge of P(z\z') which is not required for the 
two other less refined approximations). 

We caution that, although we have obtained good 
agreement between the theory and numerical results 
on simulated networks, this may not necessarily carry 
through for real networks encountered in practice. In 
particular, the Markov assumption of Eq. (|17[) may not 
always hold (e.g., due to community structure [231 ] . clus- 
tering, or degree correlations extending over more than 
one link between nodes). This remains a topic for further 
study. 

This work was supported by ONR (Physics), by the 
NSF (PHY 0456240 and DMS 0104-087), and by AFOSR 
FA95500410319. 
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